Botorch preset by AdrianSosic · Pull Request #757 · emdgroup/baybe

AdrianSosic · 2026-03-02T15:14:00Z

DevPR, parent is #745

Adds the BOTORCH preset for GPs.

Important information

I think it's critical to actually assert that the preset exactly recovers the BoTorch behavior, in the form of a test, for mainly two reasons:
1. The construction involves quite a few things to be configured, i.e. handling both singletask/multitask (the latter even requiring a new custom gpytorch module), setting all sorts of priors correctly, etc. Blindly believing that everything is correct and then just claiming this is the BoTorch behavior seems like a bad idea. The test ensures this explicitly.
2. It also as an automatic alert mechanism for all situations when something is changed on the BoTorch side, informing us about breaking changes that yield different behavior but are not fully documented (which happened already several times).
I've also invested quite some effort to test the new multitask mean logic, i.e. that it not only recovers the BoTorch logic but that it also fills one of the missing gaps that will ultimately make our transfer learning model truely scale-invariant w.r.t. the different input tasks. In particular, I made sure that the only missing piece is the noise model stratification, which should be added in a follow-up PR and is the analogous to the stratification over means shipped by this PR. (This explains the changes to the streamlit script.)

Copilot

Pull request overview

Adds a new BOTORCH Gaussian Process preset intended to exactly mirror BoTorch GP behavior (single- and multi-task), alongside supporting kernel-factory refactors and validation improvements.

Changes:

Introduce GaussianProcessPreset.BOTORCH and implement BoTorch-aligned kernel / mean / likelihood factories (including custom GPyTorch components for multitask mean + likelihood).
Add parameter-kind classification (ParameterKind) and automatic parameter-kind validation for kernel factories via a new _KernelFactory base.
Extend tests (BoTorch parity + kernel-factory validation) and update the Streamlit surrogate-model demo to expose GP presets and multitask/transfer-learning toggles.

Reviewed changes

Copilot reviewed 18 out of 18 changed files in this pull request and generated 11 comments.

Show a summary per file

File	Description
`baybe/surrogates/gaussian_process/presets/botorch.py`	Adds BoTorch preset factories for kernel/mean/likelihood.
`baybe/surrogates/gaussian_process/components/_gpytorch.py`	Introduces custom GPyTorch mean + multitask likelihood helper to match BoTorch behavior.
`baybe/surrogates/gaussian_process/components/kernel.py`	Adds `_KernelFactory` w/ parameter-kind validation; adjusts `ICMKernelFactory`.
`baybe/surrogates/gaussian_process/presets/{edbo,edbo_smoothed}.py`	Migrates kernel factories to `_KernelFactory` and updates dimension logic.
`baybe/surrogates/gaussian_process/presets/{core,__init__.py}`	Adds `BOTORCH` enum entry and re-exports preset factories.
`baybe/parameters/{enum,base.py}`	Adds `ParameterKind` and exposes `Parameter.kind`.
`baybe/parameters/selectors.py`	Removes `_ParameterSelectorMixin` (superseded by `_KernelFactory`).
`tests/test_gp.py`	Adds parity test asserting `BOTORCH` preset reproduces BoTorch posterior stats.
`tests/test_kernel_factories.py`	Adds tests for kernel-factory parameter-kind validation.
`streamlit/surrogate_models.py`	Adds GP preset selector + multitask / transfer-learning UI and logic.
`docs/conf.py`	Updates nitpick ignore list for refactor.
`CHANGELOG.md`	Documents new presets and kernel-factory parameter-kind validation.

Comments suppressed due to low confidence (1)

baybe/surrogates/gaussian_process/presets/edbo.py:85

effective_dims is computed as train_x.shape[-1], which includes the task feature column in transfer-learning searchspaces. Since this factory explicitly excludes TaskParameter via parameter_selector, the priors should likely be based on the non-task dimensionality (i.e. exclude the task feature) to avoid skewing the dimension-dependent prior heuristics. Consider deriving effective_dims from the selected non-task dimensions instead (e.g., subtract task dims or count comp-rep columns for selected parameters).

        effective_dims = train_x.shape[-1]

        switching_condition = _contains_encoding(
            searchspace.discrete, _EDBO_ENCODINGS
        ) and (effective_dims >= 50)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-17T10:54:24Z

+        base_idcs = (
+            set(dims)
+            if (dims := base_kernel.active_dims.tolist()) is not None
+            else None
+        )
+        task_idcs = (
+            set(dims)
+            if (dims := task_kernel.active_dims.tolist()) is not None
+            else None


base_kernel.active_dims / task_kernel.active_dims can be None in GPyTorch kernels. Calling .tolist() unconditionally will raise an AttributeError (NoneType has no attribute tolist). Consider guarding with an is not None check (or using getattr(..., None)) before converting to a set, and treat None explicitly in the validation logic.

Suggested change

base_idcs = (

set(dims)

if (dims := base_kernel.active_dims.tolist()) is not None

else None

)

task_idcs = (

set(dims)

if (dims := task_kernel.active_dims.tolist()) is not None

else None

base_active_dims = base_kernel.active_dims

task_active_dims = task_kernel.active_dims

base_idcs = (

all_idcs

if base_active_dims is None

else set(base_active_dims.tolist())

)

task_idcs = (

all_idcs

if task_active_dims is None

else set(task_active_dims.tolist())

Copilot · 2026-04-17T10:54:24Z

+        0 if st_enable_multitask else 1,
+        20,
+        0 if st_enable_multitask else 5,


When Multi-task is enabled, the on-task training-point slider allows 0. In the st_use_separate_gps branch (transfer learning unchecked), the on-task recommender is still run on the on-task subset, which will be empty and will likely raise during GP fitting/recommendation. Consider enforcing a minimum of 1 on-task point when st_use_separate_gps is True, or add an explicit guard that shows a Streamlit error and stops early when the on-task dataset is empty.

Suggested change

0 if st_enable_multitask else 1,

20,

0 if st_enable_multitask else 5,

1,

20,

5,

Copilot · 2026-04-17T10:54:25Z

+        if base_idcs is not None and (base_idcs > allowed_base_idcs):
+            raise ValueError(


The subset check for base-kernel active_dims is incorrect: base_idcs > allowed_base_idcs checks for a strict superset, not “not a subset”. This will miss invalid cases (e.g. {0, task_idx}) and potentially flag none. Use a proper subset validation (e.g. not base_idcs <= allowed_base_idcs) and consider a clearer error if active_dims is None (meaning “all dims”).

Suggested change

if base_idcs is not None and (base_idcs > allowed_base_idcs):

raise ValueError(

if base_idcs is None:

raise ValueError(

"The base kernel's 'active_dims' must be restricted to the non-task "

f"indices {allowed_base_idcs}; got None, which means all dimensions."

)

if not base_idcs <= allowed_base_idcs:

raise ValueError(

CLAassistant · 2026-04-24T10:47:48Z

All committers have signed the CLA.

AVHopp

The streamlit changes should be a separate PR as they have nothing to do with new presets. I do see the value in extending the streamlit but those two things should still not be combined imo.

AVHopp

Two bugs or uncatched errors in the streamlit

AVHopp · 2026-05-04T09:39:06Z

+        0 if st_enable_multitask else 5,
+    )
+    if st_enable_multitask:
+        st_n_training_points_other = st.sidebar.slider(


I got a nasty uncatched ValueError here when activating Multi-Task under Domain, setting Number of training points to some value >0, setting Number of off-task training points to 0 and de-activating Transfer Learning under Model. This should be a more meaningful error (unless it demonstrates a real issue, in which case this should be fixed):

So the question to me is how we want/should go about this in general. The streamlit was never intended as a user-facing application but is for development only. I'm thus very hesitant to apply all sorts of input validation steps to something what is pretty much a moving target.

Scienfitz

sneaky global change to mll is not acceptable

Scienfitz · 2026-05-04T12:00:51Z

chen priors look comlpetely broken in this streamlit, can you confirm why? other presets are all fine

Criterion is a BayBEGPComponent (check is already included)

AdrianSosic self-assigned this Mar 2, 2026

AdrianSosic added the new feature New functionality label Mar 2, 2026

AdrianSosic mentioned this pull request Mar 2, 2026

Beef up GaussianProcessSurrogate #745

Draft

24 tasks

AdrianSosic changed the base branch from main to dev/gp March 2, 2026 15:15

AdrianSosic added the dev label Mar 2, 2026

AdrianSosic force-pushed the feature/botorch_preset branch from a149bb3 to c3885f6 Compare March 3, 2026 08:34

AdrianSosic force-pushed the feature/botorch_preset branch from c3885f6 to 4f4fd55 Compare April 17, 2026 08:18

AdrianSosic marked this pull request as ready for review April 17, 2026 10:46

AdrianSosic requested review from AVHopp and Scienfitz as code owners April 17, 2026 10:46

Copilot AI review requested due to automatic review settings April 17, 2026 10:46

Copilot started reviewing on behalf of AdrianSosic April 17, 2026 10:46 View session

Copilot AI reviewed Apr 17, 2026

View reviewed changes

AdrianSosic force-pushed the feature/botorch_preset branch from 680d8a1 to eeaa63d Compare April 24, 2026 10:38

Copilot started work on behalf of AdrianSosic April 24, 2026 10:45 View session

Copilot finished work on behalf of AdrianSosic April 24, 2026 10:48

AdrianSosic force-pushed the feature/botorch_preset branch from e409228 to eeaa63d Compare April 24, 2026 10:51

AVHopp requested changes Apr 24, 2026

View reviewed changes

Comment thread CHANGELOG.md Outdated

Comment thread baybe/surrogates/gaussian_process/components/_gpytorch.py

Comment thread baybe/surrogates/gaussian_process/presets/botorch.py

Comment thread baybe/surrogates/gaussian_process/core.py Outdated

Comment thread streamlit/surrogate_models.py

AVHopp reviewed Apr 27, 2026

View reviewed changes

Comment thread baybe/surrogates/gaussian_process/presets/botorch.py Outdated

AVHopp added this to the 0.15.0 milestone Apr 27, 2026

AdrianSosic force-pushed the feature/botorch_preset branch from eeaa63d to 7bb590b Compare April 27, 2026 15:45

AVHopp requested changes May 4, 2026

View reviewed changes

Scienfitz requested changes May 4, 2026

View reviewed changes

AdrianSosic added 5 commits May 6, 2026 11:35

Add configurable GP optimization criterion component

8679dba

Fix circular import

d76d452

Update CHANGELOG.md

15e9dbb

Fix typing

920dd88

Extend preset tests

887c6bc

AdrianSosic force-pushed the feature/botorch_preset branch from 7bb590b to 8712965 Compare May 6, 2026 10:43

AdrianSosic added 7 commits May 6, 2026 19:12

Make preset factory exports consistent instances with UPPER_CASE naming

f213893

Drop unnecessary isinstance check

79f3604

Criterion is a BayBEGPComponent (check is already included)

Rename Criterion to FitCriterion

2f847b6

Make FitCriterion a BayBEGPComponent

19d6b3b

Rename LEAVE_ONE_OUT to LEAVE_ONE_OUT_PSEUDOLIKELIHOOD

0538b72

Rename criterion.py to fit_criterion.py

4d41104

Enable multitask mode for surrogate streamlit

578459b

AdrianSosic force-pushed the feature/botorch_preset branch 3 times, most recently from 11ff22d to b255188 Compare May 8, 2026 09:02

Add BOTORCH preset

5ae782b

AdrianSosic force-pushed the feature/botorch_preset branch from 0eb5cc0 to 7ac4c25 Compare May 8, 2026 09:09

AdrianSosic added 5 commits May 8, 2026 11:11

Extend BoTorch preset test to multitask case

105bf4f

Add custom GPyTorch components to replicate BoTorch logic

34276c7

Extend BoTorch factories to multitask case

3c9d40a

Add kernel active dimension validation to ICMKernelFactory

927e044

Fix KernelFactory return types

5d0c4ff

AdrianSosic force-pushed the feature/botorch_preset branch from 7ac4c25 to 9b90df2 Compare May 8, 2026 09:12

AdrianSosic added 5 commits May 8, 2026 12:19

Make BotorchKernelFactory support parameter selection

cfaa839

Fix active dimensions validation

e6f7dbc

Bypass kernel warning for presets

412d3dc

Update CHANGELOG.md

4072325

Rename on-task/off-task to target/source in streamlit

d8a5a9b

AdrianSosic force-pushed the feature/botorch_preset branch from 9b90df2 to d8a5a9b Compare May 8, 2026 10:19

		if base_idcs is not None and (base_idcs > allowed_base_idcs):
		raise ValueError(

Conversation

AdrianSosic commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Important information

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CLAassistant commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AVHopp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AVHopp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AVHopp May 4, 2026

Choose a reason for hiding this comment

Uh oh!

AdrianSosic May 8, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Scienfitz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Scienfitz May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

AdrianSosic commented Mar 2, 2026 •

edited

Loading

CLAassistant commented Apr 24, 2026 •

edited

Loading